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Abstract 

In this paper, we develop a computational approach for estimating the mean value of a 
quantity in the presence of uncertainty. We demonstrate that, under some mild assumptions, 
the upper and lower bounds of the mean value are efficiently computable via a sample reuse 
technique, of which the computational complexity is shown to posses a Poisson distribution. 

1 Introduction 

In many situations, it is desirable to estimate the mean value of a scalar quantity Q which is a 
function of independent random vectors V and A such that the distribution of V is known and 
that the distribution of A is unknown [3]. Namely, it is interested to estimate the expectation 
of Q = q(V,A), where (?(.,.) is a multivariate function. From modeling considerations, it is 
reasonable to assume that A is bounded in norm |.||. and radially symmetrical and nondecreasing 
in its probability density function, /a.(-) with the following notions: 

(i) The norm, \A\\. of A is no greater than a certain value r, i.e., |^|| < r; 

(ii) For any realization A of A, f^(A) depends only on, \\A\\, the norm of A; 
(hi) For any At and A 2 such that ||AJ| < HA2II, fM A i) > fji(A 2 ). 

Such assumptions have been proposed by Barmish and Lagoa pQ in the context of robust- 
ness analysis of control systems, where A is referred to as "uncertainty" because of the lack of 
knowledge of its distribution. 

In this paper, we shall focus on the estimation of the expectation E[Q] = E[g(V, A)] based on 
assumptions (i), (ii) and (hi). Such a problem is referred to as robust estimation due to the fact 
that the exact distribution of A is not available. In the special case that the maximum norm r of 
A equals 0, the robust estimation problem reduces to a conventional estimation problem. Instead 
of seeking the exact value of E[Q] which is obviously impossible, we aim at obtaining upper and 
lower bounds for E[Q]. It is intuitive that the gap between the upper and lower bounds should be 
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increasing with respect to r. Since the relation between Q and V, A can be fairly complicated, 
the Monte Carlo estimation method is the unique and powerful approach. 

The remainder of the paper is organized as follows. In Section 2, we derive upper and lower 
bounds for E[Q] based on assumptions (i), (ii) and (iii). In Section 3, we propose a Monte Carlo 
method for the evaluation of the bounds of E[Q]. In particular, we introduce a sample reuse 
method to substantially reduce the computational complexity. In Section 4, we investigate the 
computational complexity of the Monte Carlo method implemented with the principle of sample 
reuse. Section 5 is the conclusion. 

2 Bounds of Expectation 

In this section, we shall derive upper and lower bounds of E[Q] = E[g(V, A)] based on the 
assumptions described in Section 1. For this purpose, we have the following fundamental result, 
which is a slight generalization of the uniform principle proposed by Barmish and Lagoa pQ. 

Theorem 1 Let A^ be a random vector with a uniform distribution over {A : ||Z\|| < p}. Define 

M(p) = E \q(V, A™)] , M(r) = inf M(p), M(r) = sup M(p). 

0<p<r 0<p<r 

Then, M(r) < E[Q] < M(r). 

See Appendix A for a proof. Theorem 1 reveals that the computation of the bounds of E[Q] 
can be reduced to the evaluation of function M(p), which can be accomplished via Monte Carlo 
simulation. A conventional method is as follows: 

Partition interval (0, r] by grid points r = pi > pi > ■ ■ ■ > p m > 0. Let 96 i = {A : \ \A\ \ < pi\. 
For £=!,■■■ , m, estimate M.(pi) as the empirical mean 

N 

where Vi, X^, i = 1, ■ ■ ■ ,N are mutually independent random variables such that V±, ■ ■ ■ ,Vn 
are i.i.d. random samples of V and Xg t i, • • • , Xi^ are i.i.d. random samples uniformly distributed 
over 9$i. Clearly, the total number of simulations is Nm for estimating M(p^), I = 1, ■ • • , m. A 
major problem with this approach is that the computational complexity can be extremely high, 
since the number of grid points m is typically a very large number. To overcome such a problem, 
we shall develop a sample reuse technique in the next section. 

3 Sample Reuse 

In this section, we shall explore the idea of sample reuse to reduce the computational complexity. 
The sample reuse method has been proposed by Chen et al. [21 [3] for the robustness analysis 
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of control systems. The idea of sample reuse is to start simulation from the largest set SB\ and 
if it also belongs to smaller subsets the experimental result is saved for later use in the smaller 
sets. As can be seen from last section, a conventional approach would require a total of Nm 
simulations. However, due to sample reuse, the actual number of experiments for set SB a is a 
random number n^, which is usually much less than N. Hence, this strategy saves a significant 
amount of computational effort. 

In order to provide a precise description of the principle of sample reuse, we assume that 
all random variables are defined in the same probability space (f2,^,Pr). We shall introduce a 
function Sf, referred to as sample reuse function, as follows. 

Let Xi,-- - ,X m be i.i.d. samples uniformly distributed over si '. Let Y\,--- ,Y n be i.i.d. 
samples uniformly distributed over SB. Let m < n and D SB. Define reusable sample size k 
such that k(6<j) is the number of elements of {Xi(ui) G SB : i = 1, • • • , m} for any u G CI. Define 
random variables Z\, ■ ■ ■ , Z n such that, for any u> G Q, 



where i£, 1 < £ < k(cj) are the indexes of the elements of {Xi(uj) G SB : i = 1, • • ■ ,n} such that 
il is increasing with respect to I. This process of generating Z±, - ■ ■ ,Z n from X±, ■ ■ ■ , X m and 
Y\ , ■ ■ ■ , Y n is denoted by 



With regard to the distribution of Z\ , • • • , Z n , we have 

Theorem 2 Suppose X\, ■ ■ ■ , X m are independent with Y\, ■ ■ ■ ,Y n . Then, Z\, ■ ■ ■ ,Z n are i.i.d. 
samples uniformly distributed over SB. 

See Appendix B for a proof. Now we can use 5f to precisely describe the sample reuse algorithm 
for estimating M(p^), I = 1, • • • ,m. Let JQj, i = 1, • • • , N be the random samples uniformly 
distributed over SBi for I = 1, ■ ■ • , m. Let Y\^ = Xij for i = 1, • • ■ , TV and (Y^i, • • • , Y^\ k^) = 
5f(Y^_i 5 i, ■ ■ • , y^_i ) jv; ■ ■ • > X^ff) for I = 2, ■ ■ ■ , m. As a result of Theorem 1, we have that, 
for any £ G {1, • • • , m}, random variables Ya, i = 1, • • ■ , N have the same associated cumulative 
distribution with that of random variables X^, i = 1, • • ■ , N. This implies that sC^ii ^,«) 

has the same distribution as that of X}i=i sO^ii for £ = 1, • ■ ■ , m. Therefore, we can use 
5Zi=i <z(^«> as an estimator of M(/?£) for £ = 1, ■ • • , m. By virtue of such sample reuse 
method, the total number of simulations is reduced from Nm to N + *Y^i=2 n ^> w here = A r — k^ 
for £ = 2, • • • , m. As will be demonstrated in the next section, this can be a huge reduction of 
complexity for a large m. 




X ie {u) for 1 < ^ < k(w), 
Yt(u) ioT~k(uj) <£<n 



(Zi,--- 



Z n ;k)=W(X 1 ,--- ,X m ;Y lr -- 
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4 Poisson Complexity 



Since the total number of simulations for using the sample reuse method to estimate M(/%), I = 
1, • • • , m is N + ^ * s important to investigate the distribution of *Y^i=2 ^ n this re g ar d, 

we have the following general result. 

Theorem 3 For arbitrary sequence of nested sets 83\ D £$2 D • • • D with vol(^i) = V mSLjX 
and vol(£$ m ) = V m i n , the cumulative distribution function ofY2e!=2 n ^ * s bounded from below by the 
cumulative distribution function of a Poisson random variable P with mean \ = Nln(j(^y That 
is, Pr{X^2 n ^ — 0} — Pr{P = 0} and Pr {J2T=2 n £ — ^1 > P r {P < k} for any positive integer k. 
Moreover, as the maximum difference of volumes of all consecutive sets tends to be zero, Y^£=2 n £ 
converges to P in distribution. 

See Appendix C for a proof. It should be noted that the volume of a set 6W, denoted by 
vol(^), is referred to the Lebesgue measure of 88 in this paper. 
As an immediate consequence of Theorem 3, we have 

{m \ ( m ~\ 

Y^ n t >°f =Pr{P>0}, Pr \ > k I < Pr{P > jfc}, fc = 1, 2, ■ ■ ■ 

which implies that 



E 



5> £ 

1=2 . 



00 ( m ~\ oo / M \ 

^ Priori, >k \ <^Pr{P>fc} = A = iVln(^) 



fc=0 1 1=2 ) k=0 

By virtue of Theorem 3, we can derive some simple bounds for the distribution of *Y^i=2 n £ as 
follows. 

Theorem 4 Pr{^^=2 n ^ — ^} — e_A (if)' f or an y num ^ er k > X = Nln C^*^ ■ In particular, 
^{J2T=2 n i > eA ) < e ~ A and Fl i x > C 1 + e)A} < exp (-^) /or < e < 1. 

See Appendix D for a proof. 

Now we apply Theorem 3 to investigate the density of original samples of A. Suppose that 
the volume of {A : \\A\\ < p} is proportional to p d where d is the dimension of the set. Let N p 
denote the number of original samples included in {A : \ \A\ \ < p} when applying the sample reuse 
method to interval p\. Define the density of samples at radius p as V(p) = lim^o — — '—j- 
Then, we have the following result. 



Theorem 5 V(p) is equal to ^ (^) d for p € (0, f ] and is less than ^ for p E (f , a}. 

See Appendix E for a proof. From this theorem, we can obtain an upper bound for the 
expected number of original samples with norm bounded in [0, a]. As can be seen from Theorem 
5, the density function is unimodal and achieves the largest value at p = ^. The density function 
is displayed by Figure [TJ 
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Density of Samples (N = 100, lambda = 10, a = 100) 



d = 1 




Radius t 



Figure 1: Illustrative Example (N = 100, a = 100, A = 10) 

5 Conclusion 

We have proposed an efficient computational approach for estimating the mean value of a random 
function, for which the distribution of relevant random variables are not completely available. 
A Monte Carlo method with sample reuse as a key mechanism is established. The associated 
computational complexity is demonstrated to follow a Poisson distribution. 



A Proof of Theorem 1 

We follow the similar method of Barmish and Lagoa pQ. Let V denote the volume of B = {A : 
\\A\\ < r}. We partition the set B as K layers of equal volume such that the fc-th layer is 
Ck = {A : r,t_i < \\A\\ < rfc} with = ro < r\ < r<i < ■ ■ ■ < rx = r. Then, the density function 
can be expressed as 

K 

/ A (A)«^I fc (A)A fc 

k=l 

where A^, k = 1, • • • , K satisfying 

V K 

-]TA fc = l, A!>A 2 >--->A^>0 (1) 

k=l 
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and Ifc(.) is the indicator function such that Ife(A) = 1 if A falls into the A:-th layer C k and 
Ife(A) = otherwise. Let fv(-) denote the density function of V. Since V and A are independent, 
we have 



E[q{V,A)] « / q(v,A) f v (v)dv U(A)dA 

/{(«,A):||A||<r} 

K 



L 



,k=l 



dA = y^a k X kl 



q{v,A) f v (v)dv 

'{(t>,A):||A||<r} 

where a k = Jrr v A)-||A||<r} l( v > ^) ^k(A)fv{v)dv dA. Therefore, the upper and lower bounds of 
E[g(V, A)] correspond to the maximum and minimum of the linear program: ^2 k= i a k^k subject 
to constraint ([T]). Prom convex analysis, the maximum and minimum of this linear program are 
achieving at extreme points of the form: 



k=l 



^ forl<fc<j, 



for j < k < K. 

As the number of layers K tends to infinity, the summation 5Zjfc=i Ifc(A)Afc, which is associated 
with extreme point (Ai, • • • , \k), tends to a uniform distribution. This justifies the theorem. 

B Proof of Theorem 2 

Let Si C M for I = 1, • • • , n. Define D = {1, • • • , re} and l s = {(h, ■ ■ ■ , i s ) '■ i\ < • • • < i s \ U £ 
D, £ = !,-■■ ,s}. Then, 

n 

PT{Z t eSt,£=l,--,n} = PT{X»eS t , i=l,---,s; X^S, jeB\{i lr •-,{.}} 

xPr{Y e G Sg, 1 = s + !,-■■ ,n}. 



For simplicity of notations, we let Vs e = vol(Si), = vol(j^) and Vgg = vo\{3§). Note that 
Pr{y, 6 S t , I = s + 1, ■ ■ ■ , re} = n" =s+ i (H) and 

11 I V* 



Pr{X lf G £ = !,-■■ ,s; Xj j G B\ {h, ■ ■ ■ ,i s }} 



1=1 



Since there are (") elements in X s , we have 
Vr{Z e g Si, 1= 



> - sC)^O-^rn(t) ,n(t 



n(t)£(:)(F^ 



n 



This concludes the proof of the theorem. 



C Proof of Theorem 3 

We need some preliminary results. 

Lemma 1 Let N\ < N2 < • • • < N m . For £ = !,■■■ ,m, let ve = vol(^) and X^i, i = 1, • ■ ■ ,Ng 
be i.i.d. random samples uniformly distributed over SSg. Let Y\% = Xn for i = 1, • • • ,N\ and 
,Yi jNt ;k t ) = W(Y e _ liU ■ ■ ■ ,Y e _ ltNe ^; X £tl , ■ ■ ■ ,Xg }Ne ) fori = 2, • • • ,m. Define n e = 
N t - ki for £ = 2, • • • , m. Then, Pr{n, = n e , I = 2, ■ ■ ■ ,m} = UT=2 B ( N t ~ A^-i, ^) for 
Ng - N t -i < ng < N e and 2 < i < m, where B(k,n,p) = (l)p k (l - p) n ~ k . 

Proof. We use induction method. First, it is easy to show that the lemma is true for m = 2. 
Next, we assume that the lemma is true for m — 1 and show that the lemma is also true for m. Let 
Pr{(fci, • • • , k m ), (TVi, • • • , N m ), (vi, ■ ■ ■ , v m )} denote the probability that, among the N± samples 
generated from the biggest set 3$\, there are kg samples falling into 3$i for £ = 1, 2, • • • , m. Let 
P m {(«2j • • • ,n m ), (iVi,-- - ,N m ), («!,••■ ,v m )} denote the probability of event {n^ = rig, i = 
2, • • • , to} associated with the application of the sample reuse method to sets £3g, t = 1, • • • , to 
with required sample sizes N\ < N2 < • • • < N m . Let P m_1 {(n3, • • • , n m ), (N2 — k 2 , • • • , N m — 
km), {v2, ■ • • ,v m )} denote the probability of event {n^ = ng, t = 3, • • • ,m} associated with the 
application of the sample reuse method to sets 98 1, I = 2, • • • , to with required sample sizes 
N 2 - k 2 < ■ ■ ■ < N m - k m . Note that 

P m {(n 2 ,--- ,n m ), (iV"i , • • • ,N m ), («!,-•• ,v m )} 

Pr{(fci,--- ,km), (Ni,--- ,N m ), («!,•■. ,u m )} 

fe 2 >fc 3 >->fc m >0 

x P m_1 {(n 3 , ■ • • , n m ), (N 2 - k 2 , N 3 - k 3 , ■ ■ ■ , N m - k m ) , (v 2 , • • • , v m )} 
where n2 + k 2 = N2 and k\ = N±. By the mechanism of sample reuse, 

'ki-i\ fvi-i — v e 



Pr{(fcx,-- - ,k m ), (JVi,-- - ,iV m ), (ui,--- ,u m )} 



n 

i=2 



kg I V 



fe„ 



"1 



Since and — A;^ are non-decreasing with respect to £, we have that Ne — kg is non-decreasing 
with respect to I. Hence, by the assumption of induction, 

pm_1 {("3, • • • , n m ), (N 2 - k 2 , N 3 ~k 3 , • • • , 7V m - km) , (v 2 , ■ • • , 

m , v 

= ]B JVf-Tlf-i^M^f-l,— 
£=3 ^ 



e=3 
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and consequently, 



P m {(n 2 ,--- ,n m ), {N x ,--- ,N m ), {v ir -- ,v m )} 



E 



fc 2 >fc 3 >--->fc m >0 



n 

1=2 



n 

1=3 



N(,-n t - kj \vt-i 



kt-i\ ( ve-i — vt 

Vl 

Ni—ni — ki 



1 



E 



fe 2 >fe3>-->fc m >0 
\ km TTl 

v t) n 



n 

.1=2 



ki ) \N e -ne-ki 



ve-i ~ vt 



kz-i—ki m 



n 



ve-i 



Vf 



I'm 
Vl 

Ni_i —N£-\-n£—ki_i-\-kt 



Vt-1 



Ni—ng — kz 



i 



e=2 K ' t=3 

Making use of the relationships k± = Ni and ki = N2 — ri2, we have 



vt 

Vt-l 



—N£-\-ri£—k£ — i-\-ki 



km m 



if) n 



vt-i - vt 

Vl 



kt-i—kt m 



n 



(vi-v 2 f^ l\ 



x 

t=3 

N 2 —U2 



Vt 
Vt-l 



— kg 



1 



vt 

Vt-l 



Vl 



N 1 -N 2 +n 2 



and thus 



_m 
Vl 

v m 

Vl 

m 

n 

^=3 



k m m / \ ki—T—kt m 

Ve-1 - v e 



n 

1=2 

k m m 

n 

^-1 



''1 



x 



Vl 



t=3 

ki-i — ke m 
X 

1=3 



n 

m 

n 



Ni—ni 



Vt 
Vt-l 



vt-i 

Ni-x-Ni+ni 



n 



^-1 



Ng—ne 



Vt 
Vt-l 



Nt-i—Nt+nt 



Nc—rip — ki 



V( 



Ni—i — N# -\-ri£ — fef—i+fc^ 



ke-i+ki 



On the other hand, if the lemma really holds, we have 



P m {(n 2 ,--- ,n m ), (JVi,--- ,JV m ), («!,-.. ,t> m )} 



£=2 



^-1 



n 



n 



Therefore, to show the lemma, it remains to show 



E 



fc2>fe3>-">fc m >0 Lt=2 



n 



h ) \Nt -nt — ki 



n 

1=2 



N t -i 



Ni-x—Ni+ni 



Using the relationships k\ = N\ and ki = N2 — ri2, this identity can be reduced to the following 
identity 



k 2 >k 3 >--->k m >0 



n 



/ Ni-i - kt-i 
h J \ N e -n e -k e 



n 

«=3 
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which can be shown by observing that 



e n 

*i3>fc3> — >*m-<>0 

e n 

fc 2 >fc 3 > - >fe m -i-i>o L e=3 

~m — i—l 



ke-i\ ( N e _ 1 - k e - 



1=3 

~m—i—l 



h J \Nt -ri£-k 

( Ne-i - ke-i 



E 



n 



fc2>*!3> — >fcm-*-l>0 L 1 = 3 

for < i < m — 4 and 

3 



k e J \N e -rit- h 
ki-i\ ( Ni-i - h-i 



E 



J fe m _ i= 



km—i—l\ ( ^m — i—1 k 



ke J \Nt - Tit — he. 



E 

fc2>fc3>0 



n 

.1=3 



ke-i \ ( Nt-i - ki-i 



ke J \N e - n e - kg 



E 



k 2 >k 3 >0 



This completes the proof of the lemma. 



m—i — l 



"•"m—i ) m — i ^rn — i ^m — i 



JVm-i-l 
■^m—i T^m—i 



k 2 \ ( N 2 - k 2 
k 3 ) \n 3 - n 3 - h 



N 2 
N 3 - n 3 



□ 



Lemma 2 Let 9 > 1 and N > 1. Define L(6,k) = £*L (?) (l - \ 



\\i i \\ N—i 



and Lp(9, k) 



ELo {N y Y exp(-iVlng) for k = 0,1,- •• ,N. Then, L(0,O) = L P (9,0) and L{9,k) > L P (9,k) 
fork = !,-•• ,N. 



Proof. First, it is evident that L(9,0) = L P (6,Q) = 9~ N and L(9,N) = 1 > L P (6,N). Hence, 
it remains to show the lemma for k = 1, • ■ ■ , N — 1. It is easy to show that lim^oo L(#, fe) = 
lim^oo L P (9, k) = and thus limg^oo [1/(6*, fc) — L P (6, k)\ = for k = 1, ■ ■ ■ , N — 1. It can also 
be readily checked that lim^i L(#, fc) = lim^i L P (9, k) = 1 and consequently lini0^i[L(#, fc) — 



MM)] 

dL P (9,k) _ 
06 _ 



1 



and 



for fc = 1, • • • , N - 1. Noting that «M = _ _ 

"(JV In Of - {N ( - 1 lf_~^ k ] > if and only if 



(Nine)" N 



fc! 



^.wehave ^V"' 



JV 



ip{6) > 0, where (^(0) = h\9-a(9-l) with a = 



~~ k\e N + 1 
(N-iy. 



* ± < 1. Since yj(l) = and ^ 



n 



(JV-fc-l)! 

is positive for 9 G (l, we have <p(0) > for 9 G (l, ^] . Since <£> (^) > 0, lim^oo <p(0) < and 
dip }P < for 9 > —, there exists a unique number 0* greater than — such that (f{9*) = 0. Hence, 
93(0) is positive for 9 G (1,0*) and negative for > 9*. This implies that L(9,k) — L P (9,k) is 
monotonically increasing with respect to 9 G (1,9*) and monotonically decreasing with respect to 
9 G (0*, 00). Recalling that lim^i[L(6 l , /c) - L P (6>, &)] = lim e _ >00 [L(6', fe) - L P ((9, A;)] = 0, we have 
L(0, k) > L P (9, k) for any > 1. This completes the proof of the lemma. 

□ 



Lemma 3 Let Ui, Vi, i = 1, • • ■ , n be mutually independent non-negative discrete random vari- 
ables. Suppose that Pr{[/j = 0} = Pr{Vi = 0} and Pr{[/j < k} > Pr{Vi < k} for any positive 
integer k and i = 1, ■ • ■ , n. Then, Prj^ILi Ui = 0} = Pr{El=i ^» = anc ^ P r {Er=i — ^} > 
-P r {Er=i ^1 — ^} / or an y positive integer k. 
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Proof. We use induction method. The lemma is obviously true for n = 1. Assuming that the 
lemma is true for n = m — 1 > 1, we have Pr-E"ii Ui = 0} = Pr{X)£!_i U l = 0, U m — 0} = 
PrlE^T 1 ^ = 0} Pr{^„ = 0} = Pr{Eti Vi = 0} and Pr{£™ : C/, < k} = £* =0 Pr{E^ C = Cm < 
fc — /} > J2i=o P'^iSllT 1 K = ^} P T {V m < k — 1} = Pr{X)"=i ^< ^ ^} for any positive integer fc, which 
implies that the lemma is also true for n = m. By the principle of induction, the lemma is 
established. 

□ 

We are now in a position to prove the theorem. We shall first show that the distribution of 
YlT=2 n £ * s bounded from below by the distribution of a Poisson variable with mean Nln^^. 
Define Ui = rij+i for % = 1, • • • ,m — 1. Then, by Lemma [IJ U are independent binomial random 
variables such that Pr{£/j < k} = L(6i,k) for k = 0, 1, ■■■ ,N and i = , m — 1, where 

0j = Define Poisson variables V^, i = 1, ■•• , m — 1 such that f/j, V^, i = 1, ■ • • , m — 1 

are mutually independent and that Pr{V^ < A;} = Lp(9i,k) for non-negative integer k and i = 
1, • • • , m-l. By Lemmas E] and O we have Pr{£™ 2 n e = 0} = Prf^™" 1 C/i = 0} = PrlX™" 1 V i = °1 
and Pr{Efc2 n « < k ) = P^E^C < k} > Pr{Y^i l Vi < k} for k = 1, 2, • • • . Noting that 
^l) ■ " " ; ^m-i are independent Poisson variables with corresponding means N In 9±, ■ ■ ■ , iVln 6 m -i, 
we have that E^ii" 1 Vi is also a Poisson variable with mean TV E2=i = N E^ii" 1 m i7^7 = 
JVln^- = iVlniW*. 

"min 

Next, we shall show that the distribution of E^=2 n ^ tends to be the distribution of a Poisson 
variable with mean iVln-p™ as v = max{^ — vg+i : 1 < £ < m — 1}, the maximum difference 
between the volumes of two consecutive nested sets, tends to be zero while the volumes of 8$\ 
and 0B m respectively assume fixed values v\ = V max and v m = V m i n . 

Since all sample sizes are equal to N, by Lemma [H for £ = 2, • • • , m, the original sample sizes 
ii£, £ = 2, • ■ ■ ,m are mutually independent binomial random variables such that Pr{n^ = k} = 
B(k, N,p e ) for < k < N and 2 < £ < m, where p e = 1 - ^ with v e = vol(^). Therefore, the 
moment generating function of Y1T=2 n ^ can ^ e expressed as G(s) = [11^=2 (p? s + 1 — Pi)] N i where 
s G (0, 1] is a real number. Since p#s + 1 — pg is positive for any s G (0, 1] and £ = 2, • • • , m, it is 
meaningful to define g(s) = E2=2 m (^ s + 1 — Pf) f° r s S (0,1]. Hence, G{s) = exp(Ng(s)). 
For simplicity of notations, define h(s) = (s - l)ln(^) , h(s) = f° Y7=2 ^z^f+^ dz - 
Jo T.7=2 zi :;-- v V j +v dz and I 2 {s) = ! S Q Et2 V ~H^dz - £ YTU V -H^dz. The lemma can be es- 
tablished by the following three steps. 

First, it can be seen that g(s) = Ii(s) for any s G (0, 1], since ii(l) = g(l) = and 

dhjs) _ -A ve-i-ve _ -A _ dg(s) 

— ^ ef<9]/) -, — ^ ? /; ^ -I- tie — ^ 



rfs ^ s(v £ _i - + v e j^pes + l-pe ds 



for any s G (0, 1]. 
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Second, we need to show that \Ii(s) — l2(s)\ — > for any s £ (0, 1] as v — ► 0. Noting that 



z(vt_i - v e ) 2 
o vj + z(vi-i - vi)vi 



dz 



(Vj-l - Vjf 



dz 



< 



{vi-x - v e ) 2 f s z A _ _ s 2 (v i ^ 1 - v e ) 2 < s 2 v(v£-i - v e ) 



Vf 



— dz = 

o v £ 



2v 2 



2VI 



for any s G (0,1], we have 

m 

\h{s)-I 2 (s)\ < 



1=2 



z(ve-i - ve) 2 
o v\ + Z{V£-1 - v e )vi 



dz 



E 



'.{vt-x - v t f 



o vj + z(v £ -i - v e )v e 



dz 



* E 

1=2 

(s 2 + l)u 



s 2 u(ve-i - ve) ^ v(vg- X - v^ 



2V ■ 

mm 



+E 



2V 2 - 

_2 min 



2V 2 - 

mm £—2 



(s 2 + l)(V max -V ain )v 



2V 2 - 

mm 



Therefore, \Ii{s) — l2{s)\ — > for any s € (0, 1] and arbitrary vg, I = 1, • • • , m, as v —* 0. 
Third, we need to show g(s) — > h(s) as v — > 0. Since 



h(s)-I 2 (s) = 



In 



V m 



Vn, 



E 



Vi-l - Vi 



1=1 



dz — 



In 



V m 



E 

1=2 



Vt-1 



Vf 



we have |/ 2 ( S ) - h(s)\ < /' |ln (^) - EL^I^ + J? 



In 



Em 
£= 



2 i; f 



<iz, 



dz. By the 



definition of Riemann integration, Y^t= 2 Ve l e Ve ~ * Jy""" ^ = ^ n ( v"^) as v ~* ^ ^ or arbitrary 
«f , £ = 1, • • • , m. It follows that, for any s € (0, 1] and arbitrary vi, £ = 1, • • • , m, | J2 (s) — h(s)\ — > 
Oasz/^0. In view of |^(s)-ft(s)| = |/ 2 (s) - M«) + h(s) ~h(s)\ < \I 2 (s) - h(s)\ + \h(s) - I 2 {s)\, 
we have g(s) — » /i(s) as z/ — > for any s G (0,1] and arbitrary i>£, I = 1, • • • ,m. Therefore, 
we can conclude that G(s) — > exp f(s - l)iVln (v*?*)) as v — > for any s E (0, 1] and arbitrary 
V£, I = 1, • • ■ , m. This proves that n ^ converges in distribution to a Poisson variable of 

mean iVTn (^^j ■ The proof of the theorem is thus completed. 



D Proof of Theorem 4 

We need some preliminary results. 

Lemma 4 Let X be a Poisson variable of mean A > 0. For any number k > A, Pr{AT > k} < 

Proof. Since Pr{X > k} = Pr [e 1 ^'^ > 1} < E [e*^"^] for any t > 0, we have Pr{X > k} < 
mU >0 E[e^ x ^]. Note that 



E 



t{i-k)^_-\ _ Xe'-X-tk 



i=0 i=0 



E 



(Ae 



f v 
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which is minimized if and only if Ae* = k. Since k > A, we have t = In (j) > such that 
Xe t = k. For 



Ae* = k. For this value of t, we have e A e Ae * tk = e x (^r) fc . Hence, we have shown Pr{AT > 



□ 



Now we are in a position to prove the theorem. By Theorem 3, we have Pr{2~^^l2 n ^ — ^} — 
Pr{X >k}< e~ x (^) . Setting k = eX, we have Pr{XX 2 n f > eA} < e~ x . Moreover, using the 

inequality (1 + e) ln(l + e) > e + ^, Ve € (0, 1], we have Pr{XX 2 n<? > (1 + e)A} < 

exp ( — ^tM for < e < 1. This completes the proof of the theorem. 



A 

< 



E Proof of Theorem 5 

By Theorem 3, we have E[N p+s ] < N 1 + din • Now fix the gridding over (p, p + 5}. By 

Theorem 3, as the griding over p] becomes increasingly dense, we have E[7V P ] — > N [l + din (^f )] • 
This implies that, for any e > 0, we have E[AT p ] > N [l + In (^)] — e for a sufficiently dense gridding 
over p]. Hence, 

E[N p+s -N p ] = E[N p+s ] - E[N p ) < Ndln ( K ( p + S A _jVdln(^) +e = Ndln^ P + & 



Since the argument holds for any small e > 0, we have E[7V p+< 5 — N p ] < Ndln [^^rj- Therefore, 
the density V(p) = lim^o E[N »+*- N p ] < lim s ^ Ndln (^) = M. On the other hand, as the gridding 
gets dense, we have E[N p+5 - N p ] -> Ndln and thus V(p) i y. For p G (0, f ], it follows 

from Theorem 3 that N p is a binomial random variable corresponding to N i.i.d. trials with 
a success probability (^) d . Hence, E[iV p ] = N (^) d and accordingly = ^ This 
completes the proof of the theorem. 
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